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RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHNOLOGY 
SYSTEMS 
BRANCH ^ t 




The Biotechnology Systems Branch of the Scientific and Technical Information Center 
(STIC) detected errors when processing the following CRF diskette: 



Application Serial Number: 
Art Unit / Team No. : 
Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS THE ERRORS DETECTED. 

PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS 
BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT 
COMMUNICATION TO THE APPLICANTS ALONG WITH A NOTICE TO 
COMPLY or, 

2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY 

THIS WILL INSURE THAT THE NEXT SUBMISSION RECEIVED FROM THEM 
WILL BE ERROR FREE. 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 
ARTI SHAH 703-308-4212 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



, W/o% 9 fr y 



ATTNt NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



Wrapped Nucleics 



Wrapped Aminos 



Incorrect Line Length 



4 Misaligned Amino Acid 

Numbering 

5 Non-ASCII 



Variable Length 



Wrong Designation 



8 Skipped Sequences 

(OLD RULES) 



9 Skipped Sequences 

(NEW RULES) 



10 Use of N's or Xaa's 

(NEW RULES) 



The number/text at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The amino acid number/text at the end of each line "wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3. as this will prevent "wrapping". 

The rules require that a line not exceed 72 characters in length. This includes spaces. 
All text must be visible on page. 

The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It Is recommended to delete any tabs and uses spacing between the numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) features section that some may be missing. 



Seauence(s) / f ^ contain amino acid or nucleic acid designators which are not standard 
representations as per the Sequence Rules (Please refer to paragraph 1 .822) 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(I) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xl) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence Is intentionally skipped 

Please also adjust the "(Hi) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence Id number 
<400> sequence Id number 
000 

Use of N's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 



11 



12 



13 



Use of <213>Organtsm 
(NEW RULES) 

Use of <220>Feature 
(NEW RULES) 



Wrong Format 



Sequence(s) . 



are missing this mandatory field or its response. 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial" or "Unknown" 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) 
(Sec. 1.823 of new Sequence Rules) 



"File submitted was in the alphabetical heading format of the Old Sequence Rules.Tnis is invalid since the 
"Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Disclosures" 
Federal Register Notice. Vol. 63. No. 104. June 1. 1998. p. 29620 
applies to applications filed on or after July 1. 1998. 

AKS-Biotechnology Systems Branch- 7/10/98 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/096, 749 



DATE: 09/09/98 
TIME: 15:58:54 



INPUT SET: S28504.raw 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 



- — BetaHMot Comply 
Corrected Diskette Needed 



1 






i 






i 


( 1 ) 


Gene 


4 










( 1 ) 


6 






7 




( 11 ) 


8 






9 




(HI 


10 






11 




(IV) 


12 




(A 


1 3 




(B 


14 




(C 


15 


A.. 


(D 


16 


(E 


17 




(F 


18 






19 




(V) 


20 




(A 


21 




(B 


22 




(C 


2 3 




(D 


24 






25 




(vi) 


26 




(A 


27 




(B 


28 






29 




(via 


30 




(A 


31 




(B 


32 






33 




( vii 


34 




(A 


35 




(B 


36 




(C 


37 






38 




(ix) 


39 




(A 


40 




(B 


41 






42 







SEQUENCE LISTING 



General Information 



(i) APPLICANT: Koieda, Shohei 

(ii) TITLE OF THE INVENTION: ARTIFICIAL ANTIBODY POLYPEPTIDES 

(iii) NUMBER OF SEQUENCES: 118 

(iv) CORRESPONDENCE ADDRESS: 
ADDRESSEE: Schwegman, Lundberg, Woessner & Kluth P. A. 
STREET: 121 South Eighth Street, Ste. 1600 
CITY: Minneapolis 
STATE: MN 
COUNTRY: USA 
ZIP: 55402 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0b 

CURRENT APPLICATION DATA: 
APPLICATION NUMBER: 09/0 96,74 9 
FILING DATE: June 12, 1998 

PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 

) ATTORNEY/AGENT INFORMATION: 
NAME: Ann S. Viksnins 
REGISTRATION NUMBER: 37,748 
REFERENCE/ DOCKET NUMBER: 109.034US1 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: (612) 373-6900 
TELEFAX: (612) 339-3061 



ERRORED SEQUENCES FOLLOW: 



PAGE: 2 RAW SEQUENCE LISTING DATE: 09/09/98 

PATENT APPLICATION US/09/096,749 TIME: 15:58:55 

INPUT SET: S28S04.raw 

27 3 (2) INFORMATION FOR SEQ ID NO: 13: 
274 

275 (i) SEQUENCE CHARACTERISTICS: 

— > 276 (A) LENGTH: 59 base pairs 

277 (B) TYPE: nucleic acid 

278 (C) STRANDEDNESS: single 

279 (D) TOPOLOGY: linear 
280 

281 (ii) MOLECULE TYPE: cDNA 

282 (iii) HYPOTHETICAL: NO 

283 (iv) ANTI-SENSE: NO Lf^L -£<WV ~ XZ^C 

284 (V) FRAGMENT TYPE: /IV! 
285 
286 
287 
288 

289 CGGGATCCCA TATGCAGGTT TCTGATGTTC CGCGTGACCT GGAAGTTGTT 

— > 290 GCTGCGACC 59 ^ 

291 



(vi) ORIGINAL SOURCE: j/^^ j£ ^ ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

S9 



— > 



— > 



292 
293 
294 
295 
296 
297 
298 
299 
300 
301 
302 
303 
304 
305 
306 
307 
308 
309 
310 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TAACTGCAGG AGCATCCCAG CTGATCAGCA GGCTAGTCGG GGTCGCAGCA 
ACAAC 55 



311 


(2) INFORMATION FOR SEQ ID 


312 




313 


(i) SEQUENCE CHARACTERISTICS: 


314 


(A) LENGTH: 51 base pairs 


315 


(B) TYPE: nucleic acid 


316 


(C) STRANDEDNESS: single 


317 


(D) TOPOLOGY: linear 


318 




319 


(ii) MOLECULE TYPE: cDNA 


320 


(iii) HYPOTHETICAL: NO 


321 


(iv) ANTI-SENSE: NO 


322 


(V) FRAGMENT TYPE: 


323 


(vi) ORIGINAL SOURCE: 



PAGE: 3 RAW SEQUENCE LISTING DATE: 09/09/98 

PATENT APPLICATION US/09/096,749 TIME: 15:58:56 



INPUT SET: S28504.raw 

324 

325 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
326 

327 CTCCTGCAGT TACCGTGCGT TATTACCGTA TCACGTACGG TGAAACCGGT G 

328 51 
329 

402 (2) INFORMATION FOR SEQ ID NO: 20: 
403 

404 (i) SEQUENCE CHARACTERISTICS: 

— > 405 (A) LENGTH: 55 base pairs 

406 (B) TYPE: nucleic acid 

407 (C) STRANDEDNESS : single 

408 (D) TOPOLOGY: linear 
409 

410 (ii) MOLECULE TYPE: cDNA 

411 (iii) HYPOTHETICAL: NO 

412 (iv) ANTI-SENSE: NO 

413 (V) FRAGMENT TYPE: 

414 (vi) ORIGINAL SOURCE: 
415 

416 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
417 

418 CGGGATCCGA GCTCGCTGGG CTGTCACCAC GGCCAGTAAC AGCGTATACA 

--> 419 GTGAT 55 
420 

529 (2) INFORMATION FOR SEQ ID NO: 27: 
530 

531 (i) SEQUENCE CHARACTERISTICS: 

— > 532 (A) LENGTH: 51 base pairs 

533 (B) TYPE: nucleic acid 

5 34 (C) STRANDEDNESS: single 

535 (D) TOPOLOGY: linear 
536 

5 37 (ii) MOLECULE TYPE: cDNA 

5 38. (iii) HYPOTHETICAL: NO 

539' (iv) ANTI-SENSE: NO 

540 (V) FRAGMENT TYPE: 

541 (vi) ORIGINAL SOURCE: 
542 

54 3 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
544 

545 GATCAGCTGG GATGCTCCTN NKNNKNNKNN KNNKTATTAC CGTATCACGT A 

546 51 
547 



548 (2) INFORMATION FOR SEQ ID NO: 28: 
549 

5 50 (i) SEQUENCE CHARACTERISTICS: 

— > 551 (A) LENGTH: 57 base pairs 

552 (B) TYPE: nucleic acid 

553 (C) STRANDEDNESS: single 



PAGE: 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/096,749 



DATE: 09/09/98 
TIME: 15:58:57 



INPUT SET: S28S04.raw 

554 (D) TOPOLOGY: linear 
555 

556 (ii) MOLECULE TYPE: cDNA 

557 (iii) HYPOTHETICAL: NO 

558 (iv) ANTI-SENSE: NO 

559 (V) FRAGMENT TYPE: 

560 (vi) ORIGINAL SOURCE: 
561 

562 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
563 

564 TGTATACGCT GTTACTGGCN NKNNKNNKNN KNNKNNKNNK TCCAAGCCAA 

— > 565 TCTCGAT 57 
566 




567 (2) INFORMATION FOR SEQ ID NO: 29: 
568 

569 (i) SEQUENCE CHARACTERISTICS: 

— > 570 (A) LENGTH: 47 base pairs 

571 (B) TYPE: nucleic acid 

57 2 (C) STRANDEDNESS : single 

573 (D) TOPOLOGY: linear 
574 

575 (ii) MOLECULE TYPE: cDNA 

576 (iii) HYPOTHETICAL: NO 

577 (iv) ANTI-SENSE: NO 

578 (V) FRAGMENT TYPE: 

579 r (vi) ORIGINAL SOURCE: 

580 ' 

581 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
582 

583 CTGTATACGC TGTTACTGGC NNKNNKNNKN NKCCAGCGAG CTCCAAG 

584 47 
585 




586 (2) INFORMATION FOR SEQ ID NO: 30: 
587 

588 (i) SEQUENCE CHARACTERISTICS: 

— > 589 (A) LENGTH: 51 base pairs 

590 (B) TYPE: nucleic acid 

5 91 (C) STRANDEDNESS: single 

592 (D) TOPOLOGY: linear 
593 

5 94 (ii) MOLECULE TYPE: cDNA 

5 95 (iii) HYPOTHETICAL: NO 

5 96 (iv) ANTI-SENSE: NO 

5 97 (v) FRAGMENT TYPE: 

5 98 (vi) ORIGINAL SOURCE: 
599 

600 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
601 

602 CATCACTGTA TACGCTGTTA CTNNKNNKNN KNNKNNKTCC AAGCCAATCT C 

603 51 




PAGE: 5 RAW SEQUENCE LISTING DATE: 09/09/98 

PATENT APPLICATION US/09/096,749 TIME: 15:58:57 

INPUT SET: S28504.raw 

2 099 (2) INFORMATION FOR SEQ ID NO: 110: 
2100 

2101 (i) SEQUENCE CHARACTERISTICS: 

— > 2102 (A) LENGTH: 94 amino acids 

2103 (B) TYPE: amino acid 

2104 (C) STRANDEDNESS : single 

2105 (D) TOPOLOGY: linear 
2106 

2107 (ii) MOLECULE TYPE: peptide 

2108 (iii) HYPOTHETICAL: NO 

2109 (iv) ANTI-SENSE: NO 

2110 (v) FRAGMENT TYPE: internal 

2111 (vi) ORIGINAL SOURCE: 
2112 

2113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
2114 
2115 
2116 

2117 15 10 15 
2118 

2119 20 25 /^"^ 30 35 

— > 2120 Thr Gly Gly Asn Ser Pro Val(Oxx/Glu Phe Thr Val Pro Gly Ser Lys Ser Thr Ala 



Val 


Ser 


Asp 


Val 


Pro 


Arg 


Asp 


Leu 


1 








5 








He 


Ser 


Trp 


Asp 


Ala 


Pro 


Ala 


Val 


20 










25 






Thr 


Gly Gly 


Asn 


Ser 


Pro 


Val( 


Oxx 




40 










45 




Thr 


He 


Ser 


Gly 


Leu 


Lys 


Pro 


Gly 






6& 










65 / 


Gly Arg 


Gly 


Asp 


Ser 


Pro 


Ala 


Serf 








80 











2121 40 45 50 55 

2122 

2123 SI 65 / 70 75 

2124 
2125 
2126 



: J. jjyo riu lie jcl 11c now iyi y 1 111 



2127 (2) INFORMATION FOR SEQ ID NO: 111: 
2128 

212 9 (i) SEQUENCE CHARACTERISTICS: 

— > 2130 (A) LENGTH: 248 base pairs ~\ 

2131 (B) TYPE: nucleic acid / A /Cs^**-' 

2132 (C) STRANDEDNESS: single \ '■ u ^ * 

2133 (D) TOPOLOGY: linear 



zijj ( u ) tofuluuy: linear r> ^ 

2134 / (/y Iswrv 

214 3 CTGCTGATCA GCTGGGATGG TCCTGCA0TT ACCGTGCGTT ATTACCGTAT _* Ax/iJLM 0^S&- 

— ^ ; 

2146 GAAACCGGTG GTAACTCCCC GGTTCAGGAA TTCACTGTAC CTGGTTCCAA T~ J 4 l X*JLt' 

2147 GTCTACTGCT JvV-^A. _ 



2135 (ii) MOLECULE TYPE: cDNA 

2136 (iii) HYPOTHETICAL: NO 

2137 (iv) ANTI-SENSE: NO 

2138 (V) FRAGMENT TYPE: 

2139 (vi) ORIGINAL SOURCE: 
2140 

2141 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
2142 

CTGCTGATCA GCTGGGATGG TCCTGCA0TT ACCGTGCGTT ATTACCGTAT /jji- 

2144 CACGTACGGT___ ^ Qu^jJ^X 



2147 GTCTACTGCT 
2148 

214 9 ACCATCAGCG GCCTGAAACC GGGTGTCGAC TATACCATCA CTGTATACGC 



• 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/096,749 



DATE: 09/09/98 
TIME: 15:58:58 



INPUT SET: S28S04.raw 



2150 TGTTACTGGC 
2151 

2152 CGTGGTGACA GCCCAGCGAG CTCCAAGCCA ATCTCGATTA ACTACCGTAC 

2153 CTAGTAACTC 
2154 

— > 2155 GAGGATCC 248 



PAGE: l SEQUENCE VERIFICATION REPORT DATE: 09/09/98 

PATENT APPLICATION US/09/096,749 TIME: 15:59:02 

INPUT SET: S28504.raw 



Line 


Error 


Original Text 


276 


Entered (59) and Calc. Seq. Length (9) differ 


(A) LENGTH: 59 base pairs 


290 


# of Sequences for line conflicts w/ running total 


GCTGCGACC 59 


295 


Entered (55) and Calc. Seq. Length (5) differ 


(A) LENGTH: 55 base pairs 


309 


# of Sequences for line conflicts w/ running total 


ACAAC 55 


314 


Entered (51) and Calc. Seq. Length (0) differ 


(A) LENGTH: 51 base pairs 


405 


Entered (55) and Calc. Seq. Length (5) differ 


(A) LENGTH: 55 base pairs 


419 


# of Sequences for line conflicts w/ running total 


GTGAT 55 


532 


Entered (51) and Calc. Seq. Length (0) differ 


(A) LENGTH: 51 base pairs 


551 


Entered (57) and Calc. Seq. Length (7) differ 


(A) LENGTH: 57 base pairs 


565 


# of Sequences for line conflicts w/ running total 


TCTCGAT 57 


570 


Entered (47) and Calc. Seq. Length (0) differ 


(A) LENGTH: 47 base pairs 


589 


Entered (51) and Calc. Seq. Length (0) differ 


(A) LENGTH: 51 base pairs 


2102 


Entered (94) and Calc. Seq. Length (93) differ 


(A) LENGTH: 94 amino acids 


2120 


Wrong Amino Acid Designator 


Thr Gly Gly Asn Ser Pro Val Oxx Glu Phe Thr Val Pro Gly 


2130 


Entered (248) and Calc. Seq. Length (8) differ 


(A) LENGTH : 248 base pairs 


2155 


# of Sequences for line conflicts w/ running total 


GAGGATCC 




Notice of Availability of 
Checker Program 

Applicant Aid for Biotechnology Computer Readable Form (CRF) 
Sequence Listing Submissions 

The Patent and Trademark Office (PTO) has developed a computer program, called Checker, that 
will aid applicants in identifying and correcting errors prior to making submissions for compliance 
with the Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid 
Sequence Disclosures (Sequence Rules: 37CRF 1.821 through 1.825). 

Final rules were published in the Federal Register (55 FR18230) on May 1, 1990, and in the PTO 
Official Gazette (1 1 14 Off.Gaz.PatOffice 29) on May 15, 1990. 

Checker is a DOS-based software program that is intended to assist users in determining whether 
errors may be present in the sequence listings, and is not intended to guarantee that the submission 
is error-free. 

The most current version of the software is available via computer downloading, details are below. 
Copies on diskette are also available. Updated software versions will not be automatically mailed 
out; any updates will be announced in the PTO Official Gazette. 

The software can be accessed/requested from the following locations: 

1) Dial-up access through the Internet. Location is ftp://ftp.uspto.gov 
The software is in current directory: pub/checker/ 
Download all the files. Cost: Free-of-charge 

3) For diskette copies, mail to: U.S.P.T.O., OEIP, CRYSTAL PARK 3, SUITE 44 1 

WASHINGTON DC 20231 

COST FOR DISKETTE IS $ 25.00 

METHOD OF PAYMENT: 

Check payable to Commissioner of Patents and Trademarks 
VISA/ Mastercard/ Charge- Charges can be faxed to 703-306-2737 
PTO Deposit Account 



For Further Information, Contact: Arti Shall at 703-308-4212 



